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ABSTRACT 

Internet-based voice transmission, digital telephony, intercoms, telephone answering machines, and mass 
storage, we need to compress audio signals. ADPCM is one of the techniques to reduce the bandwidth in voice 
communication. More frequently, the smaller file sizes of compressed but lossy formats are used to store and 
transfer audio. Their small file sizes allow faster Internet transmission, as well as lower consumption of space 
on memory media. However, lossy formats trade off smaller file size against loss of audio quality, as all such 
compression algorithms compromise available signal detail. This paper discusses the implementation of 
ADPCM algorithm for audio compression of .wav file. It yields a compressed file 'A of the size of the original 
file. The sound quality of the audio file is maintained reasonably after compression. 
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I. INTRODUCTION 

Different compression techniques for still pictures include horizontal repeated pixel compression (pcx 
format), data conversion (gif format), and fractal path repeated pixels. For motion video, compression is 
relatively easy because large portions of the screen don't change between each frame; therefore, only the 
changes between images need to be stored. Text compression is extremely simple compared to video and audio. 
One method counts the probability of each character and then reassigns smaller bit values to the most common 
characters and larger bit values to the least common characters. However, digital samples of audio data have 
proven to be very difficult to compress; as these techniques do not work well at all for audio data. The data 
changes often and no values are common enough to save sufficient space. The sampling frequencies in use 
today are in range from 8 kHz for basic speech to 48 kHz for commercial DAT machines. The number of 
quantizes levels is typically a power of 2 to make full use of a fixed number of bits per audio sample. The 
typical range of bits per sample is between 8 and 16 bits. The data rates associated with uncompressed digital 
audio are substantial. For audio data on a CD, for example, which is sampled at 44.1 kHz with 16 bits per 
channel for two channels, about 1.4 megabits per second are processed. A clear need exists for some form of 
compression to enable the more efficient storage and transmission of digital audio data. Although lossless 
audio compression is not likely to become a dominating technology, it may become a useful complement to 
lossy compression algorithms in some applications. This is because, lossless compression algorithms rarely 
obtain a compression ratio larger than 3:1, while lossy compression algorithms allow compression ratios to 
range up to 12:1 and higher[l]. For lossy algorithms, as compression ratio increases, final audio quality 
lowers. For digital music distribution over the Internet, some consumers will want to acquire the best possible 
quality of an audio recording for their high-fidelity stereo system. However lossy audio compression 
technologies may not be acceptable for this application. Voc File Compressions technique simply removes any 
silence from the entire sample. This method analyzes the whole sample and then codes the silence into the 
sample using byte codes. Logarithmic compression such as |i-law and A-law compression only loses 
information which the ear would not hear anyway, and gives good quality results for both speech and music. 
Although the compression ratio is not very high it requires very little processing power to achieve it. This 
method is fast and compresses data into half the size of the original sample. A Linear Predictive Coding (LPC) 
encoder compares speech to an analytical model of the vocal tract, then throws away the speech and stores the 
parameters of the best-fit model. The MPEG compression is lossy, but nonetheless can achieve transparent, 
perceptually lossless compression. It may be observed that differential encoding schemes take advantage of 
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sources that show a high degree of correlation from sample to sample. These schemes predict each sample 
based on past source outputs and only encode and transmit the differences between the prediction and the 
sample value. When a source output does not change greatly from sample to sample this means the dynamic 
range of the differences is smaller than that of the sample output itself. This allows the quantization step size 
to be smaller for a desired noise level, or quantization noise to be reduced for a given step size. 

II. ADPCM ALGORITHM 

The DVI Audio compression using ADPCM (ADPCM) algorithm was first described in an IMA 
recommendation on audio formats and conversion practices. ADPCM is a transformation that encodes 16 -bit 
audio as 4 bits (a 4:1 compression ratio). In order to achieve this level of compression, the algorithm maintains 
an adaptive value predictor, which uses the distance between previous samples to store the most likely value of 
the next sample. The difference between samples is quantized down to a new sample using an adaptive step- 
size. The algorithm in [11] suggests using a table to adapt this step-size to the analyzed data. 
ADPCM has become widely used and adapted, and a variant of the algorithm performs voice encoding on 
cellular phones. 
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Fig. 1 : (a) ADPCM encoding process Algorithm 
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Fig 2. ADPCM Encoder Block Diagram 
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Fig. 3: ADPCM encoding process Flowchart 
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Fig 4. ADPCM Decoder Block Diagram 
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Figure 1(a) shows a block diagram of the ADPCM encoding process. A linear input sample X(n) is 
compared to the previous estimate of that input X(n-l). The difference, d(n), along with the present step size, 
ss(n), is presented to the encoder logic. This logic, described below, produces an ADPCM output sample. This 
output sample is also used to update the step size calculation ss(n+l), and is presented to the decoder to 
compute the linear estimate of the input sample. The encoder accepts the differential value, d(n), from the 
comparator and the step size, and calculates a 4-bit ADPCM code. The following is a representation of this 
calculation in pseudo code: 

Let B3=B2 = B1 = BO = 0 

If (d(n) < 0) then B3 = 1 and d(n) = ABS(d(n)) 

If (d(n) >= ss(n)) then B2 = 1 and d(n) = d(n) -ss(n) 

If (d(n) >= ss(n) / 2) then Bl = 1 and d(n) = d(n) -ss(n) / 2 

If (d(n) >= ss(n) / 4) then BO = 1 

L(n) = (10002 * B3) + (1002 * B2) + (102 * Bl) + B0 

For both the encoding and decoding process, the ADPCM algorithm adjusts the quantizer step size based on 
the most recent ADPCM value. The step size for the next sample, n+1, is calculated with the equation, 

ss(n+l) = ss(n) * l.lM(L(n)) 

This equation can be implemented efficiently using two lookup tables [11]. First table uses the magnitude of 
the ADPCM code as an index to look up an adjustment factor. The adjustment factor is used to move an index 
pointer located in second table. The index pointer then points to the new step size. This method of adapting the 
scale factor with changes in the waveform is optimized for voice signals. When the ADPCM algorithm is reset, 
the step size ss(n) is set to the minimum value (16) and the estimated waveform value X is set to zero (half 
scale). Playback of 48 samples (24 bytes) of plus and minus zero (10002 and 00002) will reset the algorithm. It 
is necessary to alternate positive and negative zero values because the 

encoding formula always adds 1/8 of the quantization size. If all values are positive or negative, a DC 
component would be added that would create a false reference level. 



3. RESULTS 
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Figure 6 (a) Reset condition 
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Figure 6(b) encoder out put for orbitary input 
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Figure 6(e) decoder output for obituary input. 



III. CONCLUSION 

Implementation of ADPCM algorithm for audio compression resulted in a compressed file, l A of the 
size of the original file. The sound quality of the audio file is maintained reasonably after compression. The 
compression leads to better storage scheme which can further be utilized for many applications in 
communications. The system can be used for real time audio streaming in future streaming media, such as 
audio or video files sent via the Internet. Audio/video data can be transmitted from the server more rapidly 
with this system as interruptions in playback as well as temporary modem delays are avoided. The main 
application intended for such system is to real time streaming however it could be used in various other 
applications such as Voice Storage, wireless communication equipments etc. 
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